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ABSTRACT 

We use a sample of 87 rest-frame ultraviolet-selected star-forming galaxies with mean spectroscopic 
redshift (z) = 2.26±0.17 to study the correlation between metallicity and stellar mass at high redshift. 
Using stellar masses determined from spectral energy distribution fitting to U n GlZJK s (and Spitzer 
IRAC, for 37% of the sample) photometry we divide the sample into six bins in stellar mass, and 
construct six composite Ha + [N II] spectra from all of the objects in each bin. We estimate the mean 
oxygen abundance in each bin from the [N II] /Ha ratio, and find a monotonic increase in metallicity 
with increasing stellar mass, from 12 + log(0/H) < 8.2 for galaxies with (M*) = 2.7 x 10 9 M Q to 
12 + log(0/H) = 8.6 for galaxies with (M*) = 1.0 x 10 11 M Q . The mass-metallicity relation at 
z ~ 2 is offset from the local mass-metallicity relation by ~ 0.3 dex, in the sense that galaxies of 
a given stellar mass have lower metallicity at high redshift. A corresponding metallicity- luminosity 
relation constructed by binning the galaxies according to rest-frame B magnitude shows no significant 
correlation. This lack of correlation is explained by the known large variation in the rest-frame optical 
mass-to-light ratio at z ~ 2, and indicates that the correlation with stellar mass is more fundamental. 
We use the empirical relation between star formation rate density and gas density to estimate the gas 
fractions of the galaxies, finding an increase in gas fraction with decreasing stellar mass. The median 
gas fraction is more than two times higher than that found in local star-forming galaxies, providing 
a natural explanation for the lower metallicities of the z ~ 2 galaxies. These gas fractions combined 
with the observed metallicities allow the estimation of the effective yield y e g as a function of stellar 
mass; in contrast to observations in the local universe which show a decrease in y c ff with decreasing 
baryonic mass, we find a slight increase. Such a variation of metallicity with gas fraction is best fit by 
a model with supersolar yield and an outflow rate ~ 4 times higher than the star formation rate. We 
conclude that the mass-metallicity relation at high redshift is driven by the increase in metallicity as 
the gas fraction decreases through star formation, and is likely modulated by metal loss from strong 
outflows in galaxies of all masses. Our ability to detect differential metal loss as a function of mass 
is limited by the small range of baryonic masses spanned by the galaxies in the sample, but there is 
no evidence for preferential loss of metals from low mass galaxies as has been suggested in the local 
universe. 

Subject headings: galaxies: abundances — galaxies: evolution — galaxies: high-redshift 



1. INTRODUCTION 

Correlations between mass and metallicity or lumi- 
nosity and metallicity are well-established in nearby 
galaxies, ranging over orders of magnitude in mass and 
luminos ity and spanni n g ~ 2 dex in chemical abun- 
dance. iLeaueux et alJ l)1979t) first observed a correla- 
tion between heavy element abundance and the total 
mass of galaxies; since then, most investigations have 
focused on the metallicity-luminosity relationship (e.g. 
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only a few), though others have studied c orrelations be- 
tween metallicity an d rotational velocity ijZaritskv et alJ 
I19941 iCarnettl 120021) . The relationship between metal- 
licity and stellar mas s has recently been quantified by 
iTremonti et alJ l|2004 T04 hereafter), using a sample of 
~ 53, 000 galaxies from the Sloan Digital Sky Survey 
(SDSS). Such correlations provide insight into the pro- 
cess of galaxy evolution, as they allow the study of the 
history of star formation and gas enrichment or deple- 
tion through current, observable properties. Chemical 
enrichment is a record of star formation history, mod- 
ulated by inflows and outflows of gas, while the stellar 
mass provides a more straightforward measure of the ac- 
cumulated conversion of gas into stars and thus of the 
metals returned to the gas as the byproducts of star for- 
mation. 

It has long been recognized that a correlation be- 
tween stellar mass and gas phase metallicity is a nat- 
ural consequence of the conversion of gas into stars in 
a closed system llvan den Berghl ^62; Sc hmidtl Il963t 
iSearle k, SargentJl9 72'). The principal ingredients of this 
"simple" or "closed-box" model are the metallicity, the 
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yield from star formation (denned as the mass of metals 
produced and ejected by star formation, in units of the 
mass that remains locked in long-lived stars and rem- 
nants), and the gas fraction. If there are no inflows or 
outflows of gas, the metallicity is a simple function of 
the yield and the gas fraction, and rises as the gas is 
converted into stars and enriched by star formation ac- 
cording to the yield. The model is subject to the further 
assumptions that the system is well-mixed at all times, 
that it begins as pure gas with primordial abundances, 
that stellar evolution and nucleosynthesis take place in- 
stantaneously compared to the timescale of galactic evo- 
lution (the instantaneous recycling approximation), and 
that the IMF and the yield in primary elements of stars of 
a given mass are constant. This simple model, in com- 
bination with the observation tha t lower mass galaxies 
tend to have higher gas fra ctions l)McGaueh fc de Blokl 
119971 iBell fe de JondkoOCft . results in a correlation be- 
tween metallicity and total mass. 

One of the first, best-known failures of the simple 
model is the so-called "G-dwarf problem" : the closed- 
box model overpredicts the number of low-metallicity 
stars observed in the solar neighborhood. This is one 
aspect of the more fundamental problem that galaxies 
are clearly not closed boxes. On the one hand, infall 
and mergers are essential aspects of galaxy formatio n 
(e.g. iPagel fc Patched Il975t iNaab fc Ostrikerl [2005) . 
On the other, galactic-scale winds driven by star for- 
mation (a process generally referred to as "feedback" 
from star formation activity) are a ubiquitous feature 
of starburst galaxies at low (e.g. iHeckman et al]ll990t 
[ Lehnert fc Heckmanlll996HMartinll999HStrickland et al l 
[2QQJ, among many others) and high l | Pettini et alJl2001t 
IShaplev et atll20M iSmail et all 12003ft redshifts. Met- 
als are detected in the intergalacti c medium (IGM; 
lEllison et al"ll2000t Isimcoe et alJl2004ft at high redshifts, 
and, at z = 2-3, their locations are strongly cor- 
related with the distribution of observed star-formin g 
galaxies j|Adelberger et alJl2003t lAdelbereer et al.ll2005ft . 
The potential of such supernova-powered winds to ex- 
pel gas from galaxies and thus modify their chemica l 
evolution has been known for some time; iLarsonl l)1974ft 
showed how winds could account for the mass-metallicity 
relation in elliptical galaxies by preferentially ejecting 
metals from those with lower masses. This provides 
an alternative or additional explanation for the mass- 
metallicity correlation, and its appeal has increased in 
recent years as the physical evidence of feedback has 
multiplied and models of g alaxy formation have recog- 
nized its imp ortan ce (e.g. | Hern quis^ fc gpringel) I2QQ 3j 
i Benson et aTll2003l iDekel fc Wodl2003t iNagamine et alJ 
2004: IMurrav et al.ll200^ r 

Motivated by these competing or complementary the- 
ories of the origin of the mass-metallicity relation, T04 
use the statistical power of ~ 53, 000 star-forming SDSS 
galaxies to revisit the correlation, confirming its exis- 
tence over ~ 3 orders of magnitude in stellar mass and 
1 dex in metallicity. Using estimates of the effective 
yield as a function of baryonic mass, they see evidence 
for metal depletion in low mass galaxies, finding that a 
galaxy with baryonic mass M ~ 4 x 10 9 M loses half 
its metals, while low-mass dwarf galaxies are five times 
more metal depleted than L* galaxies. They also find 
that metal depletion occurs in galaxies with masses as 



high as 10 10 M©, and interpret their results as a signa- 
ture of winds from an early starburst phase. 

Given the strength of the mass-metallicity correlation, 
and its plausible origin from either metal loss through 
winds or the change in gas fraction as gas is converted 
to stars (in combination with higher gas fractions in low 
mass galaxies), it is not unreasonable to suppose that it 
may be present in galaxies at high redshift. This has been 
difficult to test, however. The strong optical emission 
lines usually used to determine metallicities shift into the 
infrared past z ~ 1, making the required large samples 
of spectra much more difficult to acquire. As luminosity 
can be determined with considerably greater ease than 
stellar mass, the first efforts focused on the luminosity- 
meta l licity relation at high redshift j Kobuln ickv fc Kool 
121)001 iPettini et all l200ll IShaplev et all 12004ft . finding 
that galaxies at z > 2 are ovcrluminous f or their metal- 
licities when compared to local galaxies. IShaplev et alJ 
(|2004ft also found that galaxies with M* ~ 10 11 M have 
appproximately solar metallicity, while lower mass galax- 
ies have Z ~ 0.5Z Q . 

In this paper we study the relationships among stellar 
mass, luminosity and metallicity at z > 2, using a sample 
of 87 star-forming galaxies with Ha and [N II] spectra. 
We describe our sample selection, observations, and data 
reduction procedures in S|21 and discuss our methods of 
determining stellar mass and metallicity in In 21 we 
give our results, and in SjSJwe discuss their implications 
for the origin of the mass-metallicity relation. Our con- 
clusions are presented in We use a cosmology with 
H = 70 km s- 1 Mpc" 1 , Q m = 0.3, and Q A = 0.7; in 
such a cosmology, the universe at z — 2.26 (the mean red- 
shift of our sample) is 2.9 Gyr old, or 21% of its present 
age. For comparisons with solar metallicity, we use the 
most recent values of the solar oxygen abundance, 12 + 
log(O/H) = 8.66, and the solar metal mass fraction 
Z Q = 0.0126 l|Asplund et alJl200^1 . 

2. SAMPLE SELECTION, OBSERVATIONS, AND DATA 
REDUCTION 

The galaxies discusssed in this paper are drawn from a 
sample of 114 galaxies with Ha spectra described in de- 
tail bv lErb et alJ l)2006albft . The galaxies were selected 
by their rest-frame UV colors, and their redshifts were 
confirmed with rest-frame UV spectra from the LRIS-B 
spectrograph on the 10-m Keck I t elescope; an overview 
of the z ~ 2 sample is given bv ISteidel et"aD ll200l . 
We then obtained near-IR spectra for a subset of these 
galaxies. Galaxies were chosen for near-IR spectroscopy 
for a wide variety of reasons, the end result of which is a 
sample very similar to the full set of galaxies with spec- 
troscopic redshifts but somewhat biased toward objects 
that are bright in K or red in R — K or J — K. A de- 
tailed discussion of the sample selection and its relation 
to the larger sample of UV-selected galaxies is provided 
bv lErb et alJ l)2006aft . For measurements of metallicity in 
the present paper, we include all objects with Ha spec- 
tra and fiT-band magnitudes (most have J magnitudes 
as well, and 32 have also been observed at 3.6, 4.5, 5.4 
and 8.0 /mi with the IRAC camera on the Spitzer Space 
Telescope), except those with AGN signatures in either 
their rest-frame UV or optical spectra. 

The Ha spectra were obtained between May 2002 and 
September 2004 with the near-IR spectrograph NIR- 
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SPEC l|McLeaii et alll998|) on the Keck II telescope. For 
the redshifts of the galaxies presented here, Ha falls in 
the A-band; most observations were conducted with the 
N6 filter, which spans the wavelength range 1.558-2.315 
/im, and in low-dispersion mode, which provides a reso- 
lution of R ~ 1400. The data were redu c ed using stan - 
dard procedures described bv lErb et all (|2003L 12006a'). 
and flux-calibrated with reference to near-IR standard 
stars. 

The near-IR imaging was carried out wi th the Wide- 
field IR Camera fWIRC. IwTlson et alJ2003T) on the Palo- 
mar 5-m Hale telescope. We obtained ~ 9' x 9' images 
to K s ~ 22.5 and J ~ 24 in four fields, with a typical 
integration time of ~ 11 hours in each band per field. 
Data redu ction and photom etry were performed as de - 
scribed bv lErb et all l|2006aD and Shaplc v~et~aTI (|2005b). 
For a description of the mid-IR IRAC d a ta, reductions, 
and pho tome try. seelBarmbv et all l|2004|) , IShaplev et al.1 
fimm. a,ndlReddv et a.U 1200.1) . 

3. MEASUREMENTS 
3.1. Stellar Masses 

Stellar masses are determined by fitting model SEDs 
to the U n GTZJK (and IRAC, when present) pho- 
tometry, using the proc edure described i n detail by 
IShaplev et all (l2005bfl andlErb et all l|2006a|) . which uses 
the lBruzual fc Chariot! (120031) p opulation synthesis mod- 
els and the lCaTzeTti*etalTi|27)0ri) extinction law. We com- 
pare the model SEDs of galaxies with a variety of ages 
and amounts of extinction to our observed photometry, 
and obtain the star formation rate and stellar mass from 
the normalization of the best-fit model to the data. We 
try models with a constant star formation (CSF) rate 
and models in which the star formation rate smoothly 
declines with time, parameterized by SFR oc e _t / T , with 
r =10, 20, 100, 200, 500, 1000, 2000 and 5000 Myr. In 
practice, however, most star formation histories provide 
adequate fits to most objects, and we therefore use the 
CSF models unless one of the r models provides a sig- 
nificantly better fit. The b est-fit mod e ls for t he galaxies 
discussed here a re given bv lErb et alJ J2006al) . We use a 
iChabrierl fiMty IMF for the stellar masses and star for- 
mation rates, which results in stellar masses a nd SFRs 
1.8 tim es smaller than those computed using a lSaloeterl 
(1955) IMF. These stellar masses are the integral of the 
star formation rate over the lifetime of the galaxy, and 
so represent the total mass of stars formed rather than 
the mass in living stars at the time of observation. For 
our adopted IMF, the current living stellar mass is ~ 10- 
40% lower depending on the age of the galaxy. We use 
the total rather than current stellar mass because the rel- 
evant quantity for the simple chemical evolution models 
we apply in ^SJis the fraction of the initial gas mass that 
has been turned into stars. 

Uncertainties in the fitting are determined from a large 
number of Monte Carlo simulations in which the input 
photometry is varied according to the photometric er- 
rors. The simulations also take into account variations 
in the star formation history by treating r as a free pa- 
rameter with the possible values given above. As has 
often been noted for SEP modeling of t his sort (e.g. 
iPaoovich et"all 12001 IShaplev et aTTl200lL I2005H) . the 
stellar mass is the most securely-determined parameter. 



For the current sample, the mean fractional uncertainty 
(<jm^/M+) — 0.4. The sim ulations and their results are 
described in more detail bvlErb et alJ ll2006al). 

As discuss e d by iPapovich et alJ i|2001|) and 
Sha. plev et a.U ©OSb), one limitation of such mod- 
eling is the insensitivity of the data to faint, old stellar 
populations, which could be obscured by current star 
formation and thus lead to an underestimate of the 
stellar mass. We have estimated the magnitude of this 
effect by fitting a variety of two-component models to 
the observed SEDs, in which the light from the observed- 
frame AT-band and redward is constrained to come from 
a maximally old burst while a young population is fitted 
to the rest-frame UV residuals. Such models make 
little difference to the stellar masses of already massive 
galaxies, since their stellar populations already approach 
the maximum age allowed by the age of the universe 
at their redshift. The masses of low-mass galaxies can 
be increased by an order of magnitude by such models, 
but the models are generally a poor fit to the SED 
and result in star formation rates far higher than those 
determined by all other indicators, with an average 
SFR of ~ 900 Mq yr _1 . While we cannot rule out 
such models in individual cases, they are very unlikely 
to be correct on average, and we therefore consider it 
unlikely that we have underestimated the stellar masses 
of the low-mass galaxies by such a large factor. More 
general two-component models, in which the relative 
contributions from a maximally old population and a 
young burst arc allowed to vary, increase the stellar mass 
by a factor of a few at most. T he two-component models 
are described in more detail bv lErb et alJ l|2006a|) . 

As described below, for the purposes of determining 
metallicities we divide the sample into six bins by stellar 
mass, with 14 or 15 galaxies in each bin. The mean stellar 
mass in each bin ranges from 2.7 x 10 9 M Q to 1.1 x 10 11 
Mq . The means and standard deviations of the fitted pa- 
rameters for each bin are given in Tabled along with the 
star formation rates determined from Ha luminosities. 
The Ha l uminosities have been corrected for extinction 
using the Calzetti et alJ l)200 0) extinction law and the 
best-fit value of E{B — V) from the SED modeling, and 
we have applied a factor of two aperture correction de- 
termined from the comparis on of the NIRSPEC spectra 
and narrowband images fsee lErb et afll2006bl for details, 
and for a full discussion of the Ha-derived star formation 
rates and their implications). 

3.2. Metallicities 

The most direct way to determine the abundances of 
metals from the observed emission line fluxes in H II 
regions is through the measurement of the electron tem- 
perature T e . As the metallicity of the gas increases, the 
cooling through metal emission lines also increases, re- 
sulting in a decrease in T e . The ratio of the auroral (the 
transition from the second lowest to the lowest excited 
level) and nebular (the transition from the lowest excited 
level to the ground state) emission lines of the same ion is 
highly sensitive to the electron temperature, and there- 
fore the measurement of such pairs of lines has been the 
preferred method of determining abundances in H II re- 
gions. However, the auroral lines (in particular the most 
widely used line, [O III]A4363) become extremely weak 
at metallicities above ~ 0.5 solar, and undetectable at 
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all metallicities in the low S /N spectra of distant galax- 
ies. In most cases, therefore, we must use the empirical 
"strong line" abundance indicators, which are based on 
the ratios of collisionally excited forbidden lines to hy- 
drogen recombination lines. 

These are calibrated with reference to the T e method 
or, more commonly, with detailed photoionization mod- 
els. The strong line methods carry significant hazards, 
however. Substantial biases and offsets are observed be- 
tween abundances determined with different methods, 
and between different calibrations of the same metho d 
re.g.lKennicutnt^l^nMlKobulnickv fc Kewlevll200^. 
and many of the strong line indicators are sensitive to 
the ionization parameter as well as metallicity. The ad- 
vent of large telescopes, sensitive detectors, and spec- 
trographs with high throughput has enabled the mea- 
surement of abundances with the T e method in an in- 
creasingly large sample of extragalactic H II regions. 
These new data suggest that the most widely used in- 
dicator, i? 23 = ([O II]A3727+[0 IIl]AA4959,5007)/H/3, 
may systematically overestimate metallicities in the 
high-abunda nce regime (Z > Z^) by as much as 
0.2-0.5 dex llKennicutt et alJl200l iGarnett et aI1l2nol 
iBresolin et al.l2005|) . The T c method itself is not without 
difficulties, however, as temperature gradients or fluctu- 
ations in the H II regions ma y affect abunda nce determi- 
nations at high metallicities ( Stasihska 2005) . The result 
of all this is that absolute values of metal abundances are 
still quite uncertain; but, fortunately for our present pur- 
poses, relative abundances of similar objects, determined 
with the same method, are more reliable. 

The available data limit the options for determining 
the chemical abundances of the z ~ 2 galaxies. NIR- 
SPEC can observe only one band in a single exposure, 
and at these redshifts Ha and [N II] A6584 fall in the K- 
band, [O III]AA5007,4959 and H/3 in the #-band, and 
[O II]A3727 in J. We have focused our observations 
on Ha in the -fT-band, so obtaining the additional data 
needed to use the R23 indicator would triple the required 
observing time. Our only option to determine the metal- 
licity of the vast majority of the galaxies in our sample 
is thus the ratio of [N II] to Ha. In addition to minimiz- 
ing the observing time required to obtain a large sample, 
this ratio has the advantages that it is insensitive to red- 
dening and is not affected by the relative uncertainties in 
flux calibration of spectra taken in different bandpasses. 

The use of N2 = log [N HlA6 584/Ha as an abundance 
indica tor was proposed by IStorchi-Bergmann et al.1 
ill 994). and has been fu r ther discussed and re fined 
bv IRaimann et all i2000j) . IDenicolo etHl (|2002|) and 
IPettini fc Paeell (|2004T l~The [N II] /Ha ratio is affected 
by metallicity in two ways. First, there is a tendency for 
the ionization parameter of an H II region to decrease 
with increasi ng metallicity (see, for example, Figure 3 of 
IDopital 120(351) ■ thereby increasing the ratio [N II] /[N III] 
and hence [N II] /Ha. Second, nitrogen has both a pri- 
mary component whose abundance varies at the same 
rate as other primary elements, and a secondary compo- 
nent which increases in abundance with increasing metal- 
licity, further raising the [N II] /Ha ratio. Using elec- 
tron temperature measurements to deter mine ionic abun- 
dance s in 20 extragalactic H II regions, IKennicutt et al.l 
(2003) find that the nitrogen abundance is adequately 
described by a simple model with a primary compo- 



nent with constant log(N/0) = —1.5 and a secondary 
component for which log(N/0) = log(0/H) + 2.2. In 
other words, secondary nitrogen becomes important for 
12 + log(0/H) > 8.3; unless we have significantly over- 
estimated the metallicities of the galaxies in our sample, 
most of our objects are in this regime. As emphasized by 
iKewlev fc Dopital l)2002[K a drawback of the N2 indica- 
tor is its sensitivity to the ionization parameter as well as 
to metallicity. In addition, the [N II] /Ha ratio cannot be 
used to determine metallicities above approximately so- 
lar, at which the N2 index saturates as nitrogen becomes 
the dominant coolant. 

Finally, the [N II] /Ha ratio is sensitive to contamina- 
tion from AGN and shock excitation, and if possible these 
should be ruled out before using it as a metallicity indica- 
tor. This is generally done through a diag nostic line ratio 
diagram such as [O III] /H/3 vs. [N I I] /Ha (Baldwi n~et~aTI 
119811: IVeilleux fc Osterbrocklll987[k on such a diagram, 
normal galaxies and AGN fall in generally well-defined 
regions. Unfortunately we lack the data to place nearly 
all of our objects on such a diagram; because we have so 
far obtained very few [O III] /H/3 spectra, we have mea- 
surements of all four lines for only four galaxies. The 
ratios [O III] /H/3 vs. [N II] /Ha for these galaxies are 
shown in Figure^ along with ~ 96, 000 objects from the 
SP SS (small grey points ), the local star bursts analyzed 
by IKewlev et al.1 (|2001bf) (small bla ck points), and the 
z = 2. 2 galaxy (green x) discussed bv lvan Dokkum et alJ 
(2005), which shows evidence of shock ionization or an 
AGN and which falls in a clearly different region of the 
diagram than our galaxies. The dashed blue line shows 
the maximum theoretical star burst line dete r mined from 
photoionization modeling bv IKewlev et alJ l|2001aj) : for 
realistic combinations of metallicity and ionization pa- 
rameter, models of normal starbursts fall below and to 
the left of this line. The lower blue dotted line is a simi- 
lar, empirically deter mined classification line derived for 
the SDSS objects bv lKauffmann et all l)2003|) . 

Our galaxies fall between the two lines, along a se- 
quence with a higher [O III] /H/3 ratio at a given 
[N II] /Ha ratio (or a higher [N II] /Ha ratio for a given 
[O III] /H/3 ratio) compared to the SDSS galaxies 8 . A 
similar offset is seen i n star- forming galaxies at z ~ 1.4 
(jShaplev et al.ll2005cf The SDSS contains very few ob- 
jects in this region, indicating that there are important 
physical differences between the local and high redshift 
samples. One plausible way to produce such a shift 
in the [N II]/Ha-[0 III] /H/3 diagram is through some 
combination of a harder ionizing spectrum and an in- 
crease in electron density. Though they are weak, the 
density-sensitive [S II] lines in the composite spectra in- 
dicate an average electron density of n e ~ 500 cm~ 3 
(with no dependence on stellar mass), higher than that 
found in normal local g alaxies but similar t o densities 
seen i n local starbu r sts (IKewlev et al.ll200 1b). Further- 
more, IKewlev et all ll2001aj) show that a relatively hard 
EUV radiation field is required to model the line ratios 

8 We have not corrected the H/3 fluxes for stellar absorption, but 
we do not expect this to be a significant effect. We expect the 
stellar H/3 absorption line to have an equivalent width W & ^ s ^ — 5 
A IKewlev et alKOOlbtlkobulnickv et alJ1999l). whil e, assuming a 
typical ratio of Ww™ IWwa ~ 5 TLeittierer et alJ 1999l. our galaxies 
have Wh/3 ~ 50 A. 
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of local starbursts; note that a shift is also observed be- 
tween the local starbursts and the SDSS sample. De- 
tailed photoionization modeling is required to determine 
quantitatively the origin of this shift and its effect on the 
calibration of the N2 index, and for the moment the ab- 
solute calibration of the metallicity scale remains uncer- 
tain (though the relatively shallow slope of the relation 
between N2 and (O/H) means that offsets are unlikely to 
be large) . As diagnostic line ratios for more high redshift 
galaxies are measured, it is hoped that full photoioniza- 
tion modeling, with improved spectra of the Wolf-Rayet 
stars that dominate the EUV radiation field, will clar- 
ify the physical reasons at the root of the shifts in the 
[N II]/Ha-[0 III]/H/3 diagram. 
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Fig. 1 — The [O III]/H/3 vs. [N II] /Ha diagnostic diagram. The 
four galaxies in our sample for which we have measurements of all 
four lines ar e shown by the large red circles. The z = 2.2 galaxy 
discussed bv lvan Dokkum ct al. ( 2005), which shows evidence of an 
AGN or shock ionization by a wind, is shown by the green X . The 
small grey points represent ~ 96, 000 objects from the SDSS, and 
the small black point s are the local starburst galaxies studied by 
IKewlev et alj 12001b]) . The dashed line show s the maximum theo- 
retical starburst line of Kcwlcv ct al. ( 2001a); for realistic combina- 
tions of metallicity and ionization parameter, star-forming galaxies 
fall below and to the left o f this line. The do t ted li ne is a similar, 
empirical determination by Kauffman n et all 1200 3V 



We can exclude an AGN contribution to our sample 
as a whole based on the X-ray and UV pro perties of 
the galaxies. Specifically, iReddv et al.1 l)2005|) find that 
only 3% of UV-selected galaxies at z ~ 2 have X-ray 
detections in the ultra-deep 2 Ms Chandra Deep Field 
North. These galaxies, almost all of which have K s < 20, 
are not included in the composite spectra considered in 
the present analysis. Further information on AGN con- 
tamination comes from the rest-frame UV spectra of our 
galaxies, which allow the rejection of AGN on an indi- 
vidual basis in fields without deep X-ray data. As de- 
scribed further in jjfl we have constructed six compos- 
ite UV spectra, binned by stellar mass in the same way 
as the NIRSPEC spectra (see below), as well as larger 
composites of the two highest and two lowest mass bins. 
None of the composites show AGN features such as broad 
and/or high ionization emission lines, placing further lim- 
its on low-level AGN activity. On the basis of all of these 



considerations, we conclude that a significant AGN com- 
ponent to the galaxies considered here is unlikely, and 
proceed under the assumption that the emission lines we 
observe are produced in H II regions photoionized by hot 
stars. 

This work is primarily concerned with relative, average 
abundances determined from composite spectra (see be- 
low) , which can be determined with more accuracy than 
the metallicities of individual objects; this mitigates the 
uncertainties ass ociated with the N2 m ethod. We use 
the calibration of lPettini fc Paeel l)2004|) . which is based 
on H II regions whose ionic abundances have been de- 
termined from the electron temperature or from detailed 
photoionization modeling. From a sample of 137 such 
H II regions, 131 of which have abundance s determined 
with the T e method, iPettini fc Pagell l|2004ft find 



12 + log(0/H) = 8.90 + 0.57 x N2, 



(1) 



with a 1(T dispersion of 0.18 dex. They conclude that the 
N2 calibrator allows a determination of the oxygen abun- 
dance to within a factor of ~ 2.5 with 95% confidence, 
an accuracy comparable to that of the R23 method. 

A further difficulty of the N2 method is that the [N II] 
line is weak, and is generally detected in the spectra of in- 
dividual objects i n our sample only w hen they approach 
solar metallicity ijShaplev et alJl2~004|) or have especially 
strong line fluxes. In order to increase the S/N and im- 
prove the likelihood of detecting [N II] at lower metallic- 
ities, we have divided the sample into six bins by stellar 
mass, with 14 or 15 galaxies in each bin, and constructed 
a composite Ha -I- [N II] spectrum of the objects in each 
bin. The use of composite spectra has the advantage 
that, in addition to increasing S/N, it minimizes the ef- 
fects of the mass and metallicity uncertainties of indi- 
vidual objects, because we are only concerned with the 
average properties of the galaxies in each bin. We first 
shift the flux-calibrated spectra into the rest-frame, and 
then average them, rejecting the minimum and maxi- 
mum value at each dispersion point to suppress noise 
from large sky subtraction residuals in individual spec- 
tra. The six composite spectra, labeled with the mean 
stellar mass in each bin, are shown in Figure |3 

We measure the [N II] /Ha ratio by first measuring the 
Ha fluxes, central wavelengths, and widths, and then 
constraining the [N II] line to have the same width and 
a central wavelength fixed by the position of Ha. We 
use the rms of the spectrum between emission lines to 
determine the typical noise in each spectrum; because 
the galaxies are at different redshifts, the systematic ef- 
fects of the night sky lines are minimized in the compos- 
ites, and the rms provides an adequate description of the 
noise. This procedure provides a good fit to the [N II] 
line for all of the composite spectra (except in the lowest 
mass bin, where we determine an upper limit on the [N II] 
flux). The measured Ha and [N II] fluxes for each com- 
posite, and the inferred value of 12+log(0/H), are given 
in Table [3 The listed uncertainties in 12+log(0/H) in- 
clude the scatter in the N2 calibration (a la uncertainty 
of 0.18 dex reduced by yN, where N is the number of 
objects in the composite spectrum) as well as the uncer- 
tainties in the measurements of the Ha and [N II] fluxes. 

4. THE MASS-METALLICITY RELATION 
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Fig. 2. — The composite NIRSPEC spectra of the 87 galaxies in 
our sample, divided into six bins of 14 or 15 objects each by in- 
creasing stellar mass (panels a through f). The spectra are labeled 
with the mean stellar mass in each bin, and the Ha, [N II], and 
[S II] lines are marked with dotted lines (left to right respectively). 
The increase in the strength of [N II] with stellar mass can be seen 
clearly. The density-sensitive [S II] lines, while weak, indicate a 
typical electron density of n e ~ 500 cm -3 , with no significant de- 
pendence on mass; this is a value comp arable to that seen in local 
starburst galaxies (Kcwlev et al. 2001*bTl. 



Figure 13 shows the mean metallicity of the galaxies in 
each mass bin plotted against their mean stellar mass 
(large filled circles); there is a monotonic increase in 
metallicity with stellar mass. The vertical error bars 
show the uncertainty in 12 + log(0/H) from measure- 
ment uncertainties in the [N II] /Ha ratio, while the ad- 
ditional vertical error bar in the lower right corner shows 
the uncertainty due to the scatter in the N2 calibration. 
The horizontal bars show the range of stellar masses in 
each bin. The most massive galaxies in the sample have 
close to solar metall icities, a result found previously by 
iShaplev et all l)2004|) . who measured the [N II] /Ha ra- 
tio from individual spectra of the brightest objects with 
K s < 20. More typical galaxies with M* ~ 10 10 M have 
12 + log(0/H) ~ 8.4, while for the lowest mass objects 
we can only place an upper limit 12 + log(0/H) < 8.2, 
or (O/H) < 1/3 (O/H) . 

We have not found any systematic effects which could 
spuriously produce the clear observed correlation be- 
tween stellar mass and metallicity. Specifically, it is pos- 
sible that the value of M+ in the lowest mass bins has 
been underestimated if an older stellar population is al- 
ready in place in these galaxies (see the discussion in 
JQJ. This would have the effect of steepening the ob- 
served correlation, as the lowest mass bins in Figure |3] 
would move to the right by 0.3-0.5 dex, while the high 
mass bins would remain unaffected. The use of the in- 
tegrated star formation rate as the stellar mass rather 
than the mass in currently existing stars (see H3.ll) would 
also steepen and shift the correlation somewhat, with the 
lower mass points moving ~ 0.1 dex to the left and the 
upper mass points shifting left by ~ 0.2 dex. AGN con- 
tamination is highly unlikely to produce the correlation, 



given the low fraction of AGN in the sample. Variations 
in the ionization parameter are also unlikely to be cor- 
related with the assembled stellar mass. The ionization 
parameter of an H II region depends on the age of the 
ionizing cluster, whic h is very muc h less than the age of 
the galaxy (see, e.g.. lDooitall2005tk since each galaxy in 
our sample presumably contains many H II regions of 
different ages, the overall variation in ionization parame- 
ter from galaxy to galaxy should be small. There is also 
no dependence of electron density on galaxy mass; the 
density-sensitive [S II] lines in the composite spectra in- 
dicate an average density n e ~ 500 cm~ 3 , with no trend 
with stellar mass. We therefore have no reason to expect 
the ionization parameter to depend on the total stellar 
mass (other than the dependence on metallicity, which is 
included in the N2 calibration). 

The dashed line in Figure shows the mass-metallicity 
relation determined for ~ 53, 000 star-forming SDSS 
galaxies by T04, after an arbitrary downward shift of 
0.56 dex. With this shift the SDSS relation matches the 
z ~ 2 galaxies remarkably well, though it is slightly shal- 
lower. The empirical shift of 0.56 dex includes an offset 
due to the different abundance diagnostics used in the 
two studies — while the SDSS metallicity determinations 
take into consideration all of the strong nebular lines, 
ours are based on the N2 index alone, for the reasons 
explained above. For a more consistent comparison, we 
use the N2 calibration to calculate the metallicities of 
the same 53,000 SDSS galaxies, shown with small grey 
points in Figure^ the mean N2 metallicity, in bins span- 
ning the same range of stellar masses used for our sam- 
ple, is shown by the small blue triangles. The saturation 
of the [N II] /Ha ratio at metallicities approaching so- 
lar (the horizontal dotted line) is clearly apparent in the 
SDSS sample, and makes the determination of the true 
offset between the two samples difficult. For galaxies 
with M* ~ 2 x 10 9 M , the oxygen abundance of the 
z ~ 2 sample is lower by > 0.3 dex; at higher stellar 
masses the offset is smaller, due at least in part to the 
saturation of the [N II] /Ha ratio. Taking the two lowest 
mass bins as the most reliable indicators and assuming 
that the shape of the relation remains unchanged, we find 
that galaxies at z ~ 2 are ~ 0.3 dex lower in metallicity 
than galaxies of the same stellar mass today. We discuss 
the likely reasons for this offset, and its uncertainties, in 

4.1. Composite Ultraviolet Spectra 

Given the uncertainties in the absolute metallicity scale 
associated with the N2 index, it would be highly de- 
sirable to obtain independent abundance me asures for 
the ga laxies considered here. As discussed bv lRix et alJ 
( 2004) , the ultraviolet spectrum of star-forming galaxies 
is rich in stellar spectral features which provide abun- 
dance diagnostics for the young stellar populations. The 
difficulty is that these are low-contrast features, usually 
requiring data of higher quality than can be obtained 
with current instrumentation. Nevertheless, it is worth- 
while to examine whether the rest-frame ultraviolet spec- 
tra of the galaxies under study are consistent with the 
abundance trend revealed by Figure 

To this end we have constructed two composite spec- 
tra, each consisting of approximately 30 galaxies, by av- 
eraging the LRIS-B spectra of the galaxies in, respec- 
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Fig. 3. — The observed relation between stellar mass and oxygen abundance at z ~ 2 is shown by the large red circles. Each point 
represents the average value of 14 or 15 galaxies, with the metallicity estimated from the [N II]/Ha ratio of their composite spectrum. 
Horizontal bars indicate the range of stellar masses in each bin, while the vertical error bars show the uncertainty in the [N II] /Ha ratio. 
The additional error bar in the lower rig ht corner shows the additional uncertainty in the N2 calibration itself. The dashed blue line is 
the best-fit mass-metallicity relation of Trcmonti ct al. (2004), shifted downward by 0.56 dex. The metallicities of different samples are 
best compared using the same calibration; we therefore show, with small grey points, the metallicities of the ~ 53,000 SDSS galaxies of 
ITremonti et alJ 120041) determined with the N2 index. Note that the [N II] /Ha ratio saturates near solar metallicity (the horizontal dotted 
line). The blue triangles indicate the mean metallicity of the SDSS galaxies in the same mass bins we use for our sample; using the more 
reliable, low metallicity bins, our galaxies are ~ 0.3 dex lower in metallicity at a given mass. 



tively, the two lower and the two higher mass bins in 
Figure 3. The corresponding mean stellar masses are 
(M*) = 5 x 10 9 M and (M + ) = 7x 10 10 M . The coarser 
mass binning was required to improve the S/N of the 
rest-UV composites to the level where the stellar absorp- 
tion features which are sensitive to metallicity could be 
clearly discerned. The two composite spectra are shown 
in Figure after normalization to the underlyin g stella r 
continua following the prescription bv lRix et alJ {2004). 

Within the spectral region of 1150-1925A covered by 
the composites, the interval near 1400 A is particularly 
suitable for an abundance analysis; we show this por- 
tion on an expanded scale in Figure El The region in- 



cludes two blends of stellar photospheric lines, centered 
at 1370 A and 1 425 A , whose strengths were shown by 
ILeitherer et alJ l|200H) to be mostly sensitive to metal- 
licity. The "1370" feature is a blend of O V A1371 and 
Fe V AA1360 - 1380, while Si III A1417, C III A1427, and 
Fe V A1430 make up the "1425" feature. This portion 
of the spectra also encompasses the Si IV AA1393, 1402 
doublet, which has a broad stellar P-Cygni component 
formed in the expanding atmospheres of late O and early 
B supergiants; superposed on this broad component are 
narrower interstellar absorption lines due to the ambient 
interstellar medium which lies in front of the stars. 
In Figure we have also reproduced synthetic spec- 



ERB ET AL. 



tra generat ed with the Starburst99 code ijLeitherer et al.1 
119991 120011) for the standard case of continuous star for- 
mation and Salpeter initial mass function, using in turn 
the two empirical stellar libraries available in the code, 
built from spectra of OB stars in, respectively, the Milky 
Way (the solar metallicity, or Z = Zq, library) and the 
Magellanic Clouds (the MC metallicity, or Z = 1/3 Zq, 
library). Figure [5] shows that the former is a plausible 
match to the composite spectrum of the higher stellar 
mass galaxies, while the latter is consistent with the com- 
posite UV spectrum of the galaxies in the lower stellar 
mass bins. We now discuss this comparison in more de- 
tail. 

Focusing on the composite spectrum of the most mas- 
sive galaxies (the lower panel in Figure |3J), we find 
that the overall equivalent widths of the "1370" and 
"1425" photosphcric features are in agreement between 
empirical and synthetic spectra. The comparison of the 
Si IV AA1393, 1402 P-Cygni component is complicated 
by the blending with the interstellar lines which, as is 
usually the case, are stronger (and blue-shifted) in star- 
burst galaxies than in the individual stars which make 
up the stellar libraries. Nevertheless, the broad compo- 
nent of the Si IV feature has comparable optical depth 
to the solar metallicity Starburst99 model. Turning to 
the lower mass galaxies (upper panel in Figure , we 
see that all three spectral features are undetected in the 
lower stellar mass UV composite. The Magellanic Cloud 
metallicity model spectrum also shows these lines to be 
much reduced in strength. 

The S/N ratio of the data and the subtlety of the 
spectral features in question limit the above compari- 
son to qualitative statements, and we do not consider 
a more quantitative approach (such as a x 2 analysis, 
for example) to be warranted in the present circum- 
stances. Nevertheless, with only the UV spectra at our 
disposal, we would have concluded that the galaxies with 
a mean stellar mass (M*) = 7 x 10 10 M© have metal- 
licity Z ~ Z Q (or 12 + log(0/H) ~ 8.6), and that 
those with (M*) = 5 x 10 9 M Q have Z < 1/3 Z Q (or 
12 + log (O/H) < 8.1). Given the uncertainties, these 
conclusions are broadly consistent with those deduced 
from our analysis of the [N II] /Ha ratios. 

There are other differences between the two compos- 
ite UV spectra reproduced in Figure 4; for example, the 
interstellar absorption lines are significantly stronger in 
the galaxies with higher stellar mass. The strongest UV 
interstellar lines, Si II A1260, O I A1302+Si II A1304, 
C II A1334, Si II A1526, Fe II A1608, and Al II A1670, 
have rest-frame equivalent widths Wo = 2 — 3 A in the 
(M*> = 7x 10 10 M composite, and W Q = 1.5-2 A in the 
(M*) — 5 x 10 9 M Q co mposite. Since these lines are all 
strongly saturated (e.g. iPettini et al.l 12002]) . the higher 
equivalent widths are much more likely to be due to a 
larger velocity dispersion of the absorbing gas, than to an 
increase in the column densities of the metals. This may 
be related to differences in the star-formation histories of 
the two samples of galaxies; on average, star formation 
has been in progress for longer in galaxies with higher 
values of (Af*) and presumably more kinetic energy has 
been deposited in their interstellar media, stirring the 
gas to higher velocity dispersions. 

Finally, the most obvious difference between the two 
composite UV spectra is in the strengths of the two nebu- 



lar emission lines which fall within our wavelength range, 
Lya and C ill] A1908, which are much stronger in the 
lower stellar mass galaxies. Qualitatively, this difference 
is in agreement with the metallicity and kinematics dif- 
ferences discussed above. The C in] AA1907, 1909 dou- 
blet is similar to [O in] AA5007, 4959 in showing a steep 
inverse dependence on metallicity in the high metallicity 
regime — as the metallicity increases, the temperature of 
the H 11 regions decreases and so do the relative popula- 
tions of the collisionally excited levels from which thes e 
emission lines originate (e.g. IPettini fe Paeell 12004') . 
Thus, it is not surprising to find that C III] A1908 is 
strong in the UV composite with metallicity Z < 1/3Z@, 
and below our detection limit in the galaxies with metal- 
licities close to solar. The dominant factor which de- 
termines the escape of resonantly scattered Lya pho- 
tons from a star-forming galaxy is the velocity disper- 
sion of the interstellar gas through which the photons 
propagate — the larger the velocity dispersion, the smaller 
(and more blue-shifted) is the fraction of Lya photons 
that escape before being absorbed by dust and turned 
into infrared photons (e.g. iMas-Hesse et a 112003) . Thus, 
the stronger Lya flux of the galaxies making up the lower 
stellar mass composite in the top panel of Figure 4 goes 
hand in hand with the smaller velocity dispersion of their 
interstellar media, as discussed above. 

4.2. The Luminosity-Metallicity Relation 

Because the luminosity of a galaxy is far more 
easily determined than its mass, there are a great 
many more luminosity-metallicity (L-Z) relations 
than mass-metallici t y relations in the l iterature (e.g. 
Skill man et al.lll989t IZaritsky et alJll994[ iGarnett et al] 
IT99^TL^maren4e^t*alTl27)0^l^ 2005, to name 

only a few). These correlations span 11 orders of magni- 
tude in luminosity and 2 dex in metallicity, and are seen 
in galaxies of all types. Both the slope and the zeropoint 
of the relation shift depending on the bandpass in which 
the correlation is determined iSalzer et alJ 12*0 05:): it is 
traditional to use the absolute B magnitude, but both 
the slope and the dispersion of the relation decrease 
as wavelength increases to the IR, probably due to 
decreased extinction and the closer correspondence of 
the infrared luminosity to stellar mass. The metallicity- 
luminosity relation has been observed in galaxies at 
redshi fts up to z ~ 1 |Ko bulnick v et al.l*2 003: Lill y et all 
[2001 IKobulnickv fe Kewlevl 120041 iMaier et alJ 120041 
lLlang~et^ni27)04l) . at which point the shifting of most of 
the strong nebular lines into the IR makes spectroscopy 
much more difficult. Most of these studies show that 
the zeropoint of the L-Z relation evolves with redshift, 
so that galaxies of a given luminosity have decreas- 
ing metallicity with increasing redshift. The so- far 
small number of metallicity-luminosity comparisons at 
z > 2 confirm this trend, as high redshift galaxies are 
2-4 mags brighter than local galaxies of comparable 
metallicity (IKobulnickv fe Kooll2000l IPettini et alJl2001t 
Sha plev et al.ll2004D . 

We construct a luminosity-metallicity relation anal- 
ogous to our mass-metallicity relation by dividing our 
sample of 87 galaxies into six bins by rest-frame absolute 
B magnitude Mb, which we determine by multiplying 
the best-fit SED of each object by the redshifted -B-band 
transmission curve. We construct a composite spectrum 
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Fig. 4. — Composite rest-frame UV spectra of galaxies in the two lower (top panel) and two higher (bottom panel) mass bins in our 
sample. The upper spectrum is the average of 30 LRIS-B spectra of galaxies with mean stellar mass (M 4 ) = 5 X 10 9 Mq, while 28 spectra 
contribute to the lower composite for whic h (M+) = 7 X 10 10 Mq. The spectra have been divided by the underlying stellar continuum 
estimated according to the prescription by Rix et al. (2004). Differences in the stellar, interstellar, and nebular lines between the two 
composites are discussed in the text. 



of the galaxies in each bin, and measure the [N II] /Ha 
ratio and determine the oxygen abundance in the man- 
ner described in H3.2I The results are shown in Figure El 
again including SDSS metallicities determined from the 
N2 indicator. It is immediately apparent that the corre- 
lation between luminosity and metallicity is weaker than 
that between mass and metallicity; the trend with lumi- 
nosity is not monotonic, and is not statistically signifi- 
cant. A Spearman correlation test finds a 33% probabil- 
ity that the points are uncorrelated, giving a significance 
of ler. The faintest galaxies do have the lowest metallici- 
ties, however; any appearance of correlation is driven by 
this bin. 

The comparison with the SDSS sample is again made 
difficult by the saturation of the [N II] /Ha ratio around 
solar metallicity, but it is clear that the high redshift 
galaxies have both lower metallicities (subject to the 
caveats discussed in |J3} and higher luminosities than 
most of the local sample. Considering the offset of the 
more reliable lower metallicity bins, we see from Fig- 
ure that the z ~ 2 galaxies are approximately three 
magnitudes brighter than local galaxies with the same 
oxygen abundance. It is difficult to invert the compar- 
ison to determine the difference in metallicity between 
galaxies of a given luminosity in the two samples, how- 
ever, since virtually all of the local galaxies as bright as 
the z ~ 2 sample have solar or greater abundances which 



cannot be accurately determined by the N2 method. In 
order for the best-fit L-Z relation determined by T04 to 
pass through the average luminosity and metallicity of 
our sample, it must be shifted downward by ~ 0.9 dex. 
After allowing for the ~ 0.25 dex systematic difference 
between the metallicity diagnostics used by T04 and here 
(as discussed in we are still left with a shift of 0.6- 
0.7 dex between the local and high redshift samples. Our 
mean values are offset from ot her local L-Z relations by 
amou nts ranging from ~ 0.4 ijSkillman et al.lll989|) to 
~ 1.0 ijLamareille et al.ll2004|) dex. The large offsets be- 
tween the various local relations are due to differences in 
calibration methods and sample selection; the relations 
are not directly comparable, and it is not yet clear which 
of them, if any, provides the most appropriate compar- 
ison for our sample. While it is probably a robust con- 
clusion that the z ~ 2 galaxies are more metal-poor than 
their luminous local counterparts, a better understand- 
ing of systematic effects is required to reliably quantify 
the differences. 

Our results are consistent with previous L-Z determi- 
nations at z > 2, and, moreover, they are not surprising 
given our knowledge of the high redshift sample. Star- 
forming galaxies at z ~ 2 have lower average mass-to- 
light ratios M/L than local galaxies, and the variation 
in M/L at a given rest-fra me optical luminosity can be 
as much as a factor of - 70 (Sh anlev et aTll2005bD . This 
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Fig. 5. — Close-up of the 1400 A region in the two composite 
spectra shown in Figure HI (black histogram). This region con- 
tains two blends of stellar photospheric lines, labeled "1370" and 
"1425" , whose strength is thought to depend primarily on metallic- 
ity, and the Si IV AA1393, 1402 doublet which consists of a broad 
P-Cygni stellar absorption on which narrower interstellar lines are 
superposed. The red histogram shows the Starburst99 spectrum 
for the standard case of continuous star formation with a Salpeter 
IMF and 'Magellanic Cloud' (upper panel) or solar (lower panel) 
metallicities. 




Fig. 6. — The luminosity-metallicity relation at 2 > 2. We have 
divided the sample into six bins by rest-frame absolute B magni- 
tude, and estimated the metallicity in each bin. The symbols are 
the same as in Figure|3] The points are not significantly correlated, 
though the faintest galaxies do have the lowest metallicities. The 
lack of correlation can be understood through the large variation in 
the optical mass-to-light ratio at high redshift. The z ~ 2 galaxies 
are ~ 3 magnitudes brighter than the SDSS galaxies at a given 
metallicity, as estimated by the [N II] /Ha ratio. 




Fig. 7. — Absolute rest-frame B magnitude vs. stellar mass, for 
the individual galaxies in the z ~ 2 sample (large red circles) and 
the SDSS (small grey points). The dashed and dotted lines show 
constant mass-to-light ratios M/L; in the z ~ 2 sample, the total 
range in luminosity is narrow and M/L varies by a factor of ~ 70 
at most luminosities, while for most SDSS galaxies, M/L varies by 
a factor of 2—5 at a given luminosity. 



large variation in M/L explains the lack of correlation 
in the z ~ 2 L-Z relation compared to the local relation. 
Figure [3 shows absolute magnitude Mb plotted against 
stellar mass for the individual galaxies in the z ~ 2 and 
SDSS samples. The z ~ 2 sample is shown by large red 
points; at a given luminosity, M/L varies by up to a fac- 
tor of 70, and the range in luminosity is small, so that 
galaxies with a wide range of stellar masses fall in each 
of the six bins in luminosity. In contrast, M+ and Mb 
are much more tightly correlated in the SDSS sample 
(small grey points); at a given luminosity, most of the 
points arc within a factor of a few in M/L. If the funda- 
mental correlation is between metallicity and mass, these 
differences in M/L would shift the relation to higher lu- 
minosities and dramatically increase the scatter, as we 
have observed. A mass-metallicity correlation is clearly 
more physically meaningful than a luminosity-metallicity 
correlation at high redshifts; a corollary is that the local 
metallicity-luminosity relation is simply a result of the 
strong correlation between mass and luminosity at low 
redshift. While it is likely that an L-Z relation deter- 
mined in the rest-frame IR would show a stronger corre- 
lation, investigation of this possibility must wait until a 
larger sample of high z galaxies with both mid-IR pho- 
tometry and nebular line spectra has been assembled. 

5. THE ORIGIN OF THE MASS-METALLICITY RELATION 

A correlation between gas-phase metallicity and stel- 
lar mass can plausibly be explained either by the ten- 
dency of lower mass galaxies to have larger gas frac- 
tions ijMcGangh de Rlokl 119971 Ifiell fc de Jo^l27if)fl 
and thus be less enriched, or by the preferential loss 
of metals from galaxies with shallow potential wells by 
galactic-scale winds. With the relevant information on 
the star, gas, and metal content of the galaxies, the two 
effects can be differentiated. In the simple, closed-box 



THE MASS-METALLICITY RELATION AT z > 2 



11 



model of chemical evolution with no inflows or outflows, 
the mass fraction of metals Z is a simple function of the 
gas fraction fi = M gas /(M gas + M*) and the true yield 
y, which represents the ratio of the mass of metals pro- 
duced and ejected by star formation to the mass of metals 
locked in long-lived stars and remnants. The true yield 
is a function of stellar n ucleosynthesis, and as in previ- 
ous, similar studie s ('T04.lGarnetti l2002) we assume that 
it is constant (see IGarnetti 1200 J for a discussion). The 
metallicity is then given by 

Z = yln(V/*). ( 2 ) 
This equation can be inverted to determine the effective 
yield y e g from the observed metallicity and gas fraction, 
y e ff = Z/ln(l//i). If the simple model applies, y e g will 
be constant for all masses and equal to the true yield 
y, while a decrease in y c g (either with respect to the 
expected true yield or, more commonly, as a function of 
mass) is a signatur e of outflows or o f dilution by the infall 
of metal-poor gas 

T04 determined the effective yields o f the SPSS galax - 
ies by using the empirical Schmidt law l|Kennicuttl ll998'l . 
which relates the star formation rate per unit area to 
the gas surface density, to estimate gas masses. They 
found lower effective yields in galaxies with lower bary- 
onic masses (the baryonic mass is expected to corre- 
late with the dark matter co ntent, and thus indica te the 
depth of the potential well; iMcGaueh et alj f2000). and 
interpreted this result as evidence for the preferential loss 
of metals in low-mass galaxies. We carry out a similar 
analysis on our sa mple of z ~ 2 galaxies. As described by 
lErb et all ll2006aD . we use a galaxy's Ha luminosity and 
the spatial extent of its Ha emission ru Q (after deconvo- 
lution with the seeing point spread function) to calculate 
the Ha luminosity per unit area £hq- We then estimate 
each galaxy's gas surface density £ gas by combining the 
iKennicuttl l)1998l) relation between star formation rate 
per unit area and gas density with the conversion from 
Ha luminosity to SFR from the same paper as follows: 

£ gas = 1.6 x 10- 27 ( ^—^)°' n M© pc- 2 . (3) 

V erg s 1 kpc / 

The estimated gas mass is then M gas ~ ^gas^nc*- We 
next combine M gas and to obtain an estimate of the 
gas fraction fj,, and compute the mean value of \i in each 
mass bin. Although the considerable (factor of ~ 2) un- 
certainties in both the corrected Ha flux and the galaxy's 
size translate into a significant uncertainty in \x for indi- 
vidual objects, the number of galaxies in each bin allows 
us to determine the mean value of /i to within ~ 10%. 

Galaxies at low and intermediate redshifts show sig- 
nific ant correlations between metallicity a nd extinction 
Ce.g. lCortese et al.ll2005t iMaier et al.ll2005ft . Galaxies at 
z ~ 2 also show a strong correlation between dust ob- 
scuration and stellar mass (a nd hence metallicity), as 
shown bv iReddv et alJ l|2006|) . who use the 24 ^m lu- 
minosity as observed by the Spitzer Space Telescope to 
infer the infrared luminosity L\n and the extinction as 
p arameterized by Ltr Z-Ltiv- One implication of the work 
of IReddv et"aD l|200fl) is that E(B - V) as determined 
from the UV slope may overestimate the extinction cor- 
rection for galaxies with ages less than 100 Myr. This 
effect can be seen in the high mean value of E(B — V) 



for the lowest mass bin in Tabled f° r galaxies with ages 
greater than 100 Myr in the current sample, E(B — V) 
and stellar mass are correlated with 4ct significance. This 
overestimation of the extinction in young galaxies has a 
negligible effect on our current analysis; we estimate that 
it will lead to an overestimation of the SFRs by a typical 
factor of ~ 1.2, considerably less than other uncertain- 
ties. Only the lowest mass bin is affected, since it is the 
only one that contains a significant number of young ob- 
jects; because we derive only an upper limit on the mean 
metallicity of the galaxies in this bin, it does not affect 
our conclusions. 

As in the local universe lIMcGaugh fc de BlokllT997t 
iBell fc de Jondl2"000() . there is a strong trend of decreas- 
ing gas fraction with increasing stellar mass. The low- 
est mass bin in the sample has a mean gas fraction 
(fi) = 0.85, while the highest mass bin has (/x) = 0.22. 
The median value of /x in our sample is ~ 0.5, signif- 
icantly higher than the corresponding median value of 
- 0.2 for the SDSS galaxies considered by T04. The low 
metallicities, low stellar masses and high gas fractions of 
the objects in the lowest mass bin suggest that they are 
young objects just beginning to form stars; this is also 
indicated by the ages from the SED modeling (see Ta- 
blc^) . The correlations be tween age a nd oth er properties 
are discussed in detail bv lErb et al. (20063) m the con- 
text of the comparison of stellar and dynamical masses. 

A consequence of the large gas fractions of the low stel- 
lar mass bins is that the baryonic mass M gas + M+ of the 
galaxies in the sample spans a much smaller range than 
the stellar mass. The difference in mean stellar mass be- 
tween the highest and lowest mass bins is a factor of 39, 
while the difference in mean baryonic mass between the 
same bins is only a factor of six. This relatively small 
range in baryonic mass also limits our ability to detect 
differential loss of metals as a function of mass. We show 
the variations of M*, M gas , and A/bar with metallicity in 
Figure |H] Notably, the increase in baryonic mass along 
our sequence of six bins is driven almost entirely by the 
increase in stellar mass, while the gas mass remains rel- 
atively constant. This is because the average star for- 
mation rate and Ha luminosity vary much less than the 
average stellar mass across the bins; in particular, galax- 
ies in the lowest mass bin have higher than average SFRs. 
This may seem contrary to the tre nd of increasing SFR 
at brighter K magnitudes found bv IReddv et alJ {2005), 
but we account for the difference both because we are less 
likely to detect Ha emission in if -fai nt galaxies unless it 
is especially strong (as discussed bv lErb et al.ll2006albf ) 
and because galaxies are more li kely to be detected i n the 
K-h&ud if they have high SFRs. IReddv et all lj2006|) also 
show that the correlation of stellar mass and bolometric 
luminosity is relatively weak, and that low stellar mass 
galaxies span a wide range in Lbol- For present purposes, 
there is a lower limit on the gas masses we are able to 
detect corresponding to our Ha flux limit, and the ab- 
sence of galaxies with both low stellar masses and low gas 
masses is probably a selection effect. We also note that 
the dynamical masses derived from the Ha line widths 
are a better match to the baryonic masses than the stel- 
lar masses , with Md vn ^> for the objects in the lowest 
mass bin. lErb et all i|2006aft discuss this comparison m 
detail. 

The mean gas fractions and effective yields for each 
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mass bin are given in Table El In contrast to the SDSS 
sample, there is no decline in the effective yield with 
decreasing baryonic mass; in fact we see the opposite, as 
j/ e ff increases slightly with decreasing mass. The points 
in each panel of Figure El show the mctallicity Z in each 
mass bin plotted against the mean gas fraction /z, with 
decreasing gas fraction from left to right to show the 
increase in Z as \i declines. We first discuss the data 
in the context of the closed box model with no inflows 
or outflows. For such a model, lines of constant yield 
are curves given by Eq. El The uppermost (black) curve 
on each of the three plots shows the variation of Z with 
gas fraction in this model for three different values of 
the true yield y; from left to right we show y equal to 
our observed effective yield y c g = 0.008 ~ 0.6 Z Q , solar 
yield y = 0.0126, and a supersolar yield y = 1.5 Z Q . The 
points arc formally consistent with the y ~ 0.6 Zq model 
in the left panel, as the measured value of y e g in each bin 
is within la of the weighted mean y c g = 0.008 in 4 of 
the bins, and within 2a of this value in the fifth (we do 
not consider the lowest mass bin, for which we find only 
an upper limit on Z and y e s)l however, the systematic 
tendency of the lower mass points to fall above the black 
line indicates that the model is not a good fit. A % 2 test 
confirms this, giving a value of 9. 

Given the ubiquitous signature of galactic-scale out- 
flows in the kinematics of the z ~ 2 galaxies, we now 
consider the effects of such outflows on the mctallicities 
of the galaxies. To address this question we modify the 
simple model to include gas outflow at rate M M Q yr" 1 , 
which is a fraction / of the star formation rate; note that 
in this model the outflow rate does not directly depend 
on the mass of the galaxy. We assume that the metallic- 
ity of the outflowing gas is the same as the metallicity of 
the gas that remains in the H II regions in the galaxy. It 
can then be shown that the metallicity is given by 



^ = y(i + /)- 1 in[i + (i + /)( M - 1 -i)] 



(4) 



Observations at low and high redshift suggest that M is 
com parable to the star formation rate , and may be higher 
(e.g. lMartinll999UPettini et alJl2000tlMartinl2 003). The 
curves in each panel of Figure El show the evolution of Z 
with gas fraction for, from top to bottom, f — (black; 
the closed box model), 0.5 (purple), 1 (blue), 2 (green), 
and 4 (red). A decrease in y e g with decreasing gas frac- 
tion such as we have inferred is a general feature of mod- 
els with constant M/SFR; more specifically, from com- 
parison with the points in each panel, it is apparent that 
the data are best matched by a model with supersolar 
yield y ~ 1.5 Zq and a high outflow rate M ~ 4xSFR 
(the red line in the righthand panel). We caution that 
the uncertainties in both mctallicity and gas fraction are 
too large to distinguish between the above models with 
confidence; nevertheless the best-fitting model is not im- 
plausible, as we discuss below. 

An outflow rate of several times the star formation rate 
has already been suggeste d by observat i ons o f the the 
lensed LBG MS1512-cB58: lPettini et"aD |200l found a 
value of M higher than the star formation rate using 
conservative assumptions for the size of the galaxy and 
outflow velocity. This scenario is also in good agreement 
with observations and predictions of the enrichment of 
the intracluster medium (ICM). The ICM contains sev- 



eral times more mass in baryons than is found in the clus- 
ter galaxies themselves, and thus it contains most of the 
meta ls in clusters as well (e.g. lMushotzkv fc Loewensteinl 
1997). Recent models for the enrichment of the ICM 
l|De Lucia et all 120041 iNagashima et all 120051) use feed- 
back from supernovae to transport metals out of galax- 
ies, finding that enrichment of the ICM occurs at high 
redshift, with of order half of the metals produced by 
z ~ 2, and that half or more of the met als in the ICM are 
produ ced in massive galaxies (L > L* , [Nagashima^^^ 
2005; or baryonic mass > 10 10 h~ x M^ . iDe Lucia et al 



2004). These models also require either a top-heavy 
IMF in starbursts or a significantly enhanced yield in 
order to reproduce the observed metal abundances in 
clusters; some such variant IMF is a feature of many 
models of st ar formation and chemical enrichment in 
clusters (e.g. iZepf fc SiUd 119961 iFinoguenov et alJl2003t 
iTornatore et al.ll2004|) . The supersolar yield suggested 
by the best-fitting model shown in Figure El may support 
such a scenario, although we emphasize that our data do 
not requi r e an I MF with more high-mass stars than the 
iChabrierl <)2003|) IMF we use here. Such an IMF, when 
combined with estimates of yields fro m stellar nucleosyn- 
thesis (e.g. iMevnet fc Maederl 12002^ . results in a metal 
yield that is solar or higher. 
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Fig. 8. — The variation of stellar, gas, and baryonic mass in 
each of the six bins with metallicity. Across the observed range in 
oxygen abundance, stellar mass increases strongly, baryonic mass 
increases weakly, and gas mass remains approximately constant. 
We thus see an increase in metallicity with decreasing gas fraction. 
The strong correlation between stellar mass and age means that 
metallicity also increases with the age of the stellar population. 
The error bars at upper left show typical uncertainties in gas and 
baryonic masses and metallicities; uncertainties in stellar masses 
are smaller. 



5.1. Redshift Evolution of the Mass- Metallicity Relation 

With estimates of the gas fractions in hand we now 
return to the ~ 0.3 dex offset between the local and 
z <~ 2 mass-metallicity relations. There are several possi- 
ble reasons for this factor of ~ 2 difference. The magni- 
tude of the offset itself is not well-determined because of 
the uncertainty in our metallicity scale indicated by the 
offsets between the [O III]/H/3 and [N II] /Ha ratios rel- 
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Fig. 9. — The points in each panel show the mean metallicity Z in each mass bin, plotted against the mean gas fraction fi. Gas fraction 
decreases from left to right, to show the increase in metallicity with decreasing gas fraction. The curves illustrate the variation of Z with 
/i for three different values of the true yield: from left to right, y = O.GZq (our observed effective yield), y = Zq, and y = 1.5 Zq. Within 
each panel, the curves show varying values of the mass outflow rate M, parameterized as a fraction / of the star formation rate. From top 
to bottom in each panel, the curves correspond to / = (black; the closed box model), 0.5 (purple), 1 (blue), 2 (green) and 4 (red). The 
data are best matched by a model with supersolar yield and an outflow rate M ~ 4xSFR. The horizontal dashed grey line corresponds 
to solar metallicity, and the small error bar in the lower right corner of each panel shows the systematic uncertainty in the metallicity 
calibration. 



ative to the SDSS galaxies seen in Figure 2] The effects 
of these offsets cannot be quantified reliably without de- 
tailed modeling of a larger sample of galaxies with all four 
emission lines, but meanwhile we can use several tech- 
niques to estimate the possible uncertainty in metallicity 
that may result. We have another method of determin- 
ing the metallicity of the four ga laxies with all four emis - 
sion lines in the 03A^2 index of iPettini fc Paeell {2004): 
Q3iV2 = log {JO II I1A5007/H/3)/([N II]A6583/Ha)}. 
IPettini fc Pagell <|2004f) find that 037V2 varies with oxy- 
gen abundance as 

12 + log (O/H) = 8.73 - 0.32 x 05N2. (5) 

Using this calibration to determine the metallicities of 
the four galaxies with all four lines, we find oxygen abun- 
dances that are 0.17 dex lower on average than those de- 
termined using [N II] /Ha. This is one plausible estimate 
of the metallicity uncertainty due to the differing physical 
conditions in the H II regions of the z ~ 2 galaxies. We 
can obtain another by assuming that the observed shift 
in the [O III]/H/3 and [N II] /Ha diagram is entirely in 
the [N II] /Ha direction; then the relatively shallow slope 
of the relationship between [N II] /Ha and (O/H) leads 
to a maximum offset in metallicity of ~ 0.3 dex. The ef- 
fect of an offset in the [O III]/H/3 and [N II] /Ha diagram 
on metallicity deter minations is considered in detail by 
Sha plev et afl l|2005aD . and we refer the reader to Section 
5.3 of that paper for a discussion. These authors consider 
the effect of a harder ionizing spectrum in high redshift 
galaxies, and conclude that this could plausibly lead to 
a factor of ~ 1.5-2 uncertainty in metallicity. 

Another possible issue is that the SDSS metallicities 
are biased toward the innermost regions of the galax- 
ies observed, where the spectrograph fibers were posi- 
tio ned. According to a recent re-analysis of this effect 
byim ison fc Kewlevl l)2005|) . this fact alone can account 
for a ~ 0.15 dex offset between nuclear and global (i.e., 
integrated over the whole galaxy) metallicities. How- 
ever, this effect may be mitigated by the fact that our 
global spectra are also biased toward the galaxies' cen- 



tral regions, which have the highest surface brightness. 
We conclude that the uncertainty in the metallicity offset 
between the z ~ 2 and local galaxies is approximately a 
factor of two, about the same as the offset itself. 

There are reasons to believe that there may be a real 
evolutionary offset in the mass-metallicity relation, and 
we proceed with the discussion under this assumption. 
As described above, we find that the galaxies in our sam- 
ple have an average gas fraction more than two times 
higher than the galaxies studied by T04, placing them 
at an earlier stage in the process of converting their gas 
to stars. Given this less evolved state, it is not surpris- 
ing that they should have lower gas-phase metallicities. 
Redshift evolution of the mass-m etallicity relation has 
also recently been investigated bv iSavaglio et al.1 (|2005f) 
with a sample of galaxies at 0.4 < z < 1, finding that a 
galaxy of a given stellar mass tends to have lower metal- 
licity at z ~ 0.7 than at z ~ 0. Using the SDSS relation 
of T04, their sample at z ~ 0.7, a nd the z ~ 2 galaxies 
presented bv lShaplev et alJ l|2004() . they develop an em- 
pirical model for the redshift evolution of the relation. 
This model predicts metallicities an average of ~ 0.2 dex 
higher than we observe, though at least some of this offset 
is likely to be due to the different metallicity indicators 
used (it is not possib le to place our sample and that of 
ISavaglio et al.l l)2005j) on the same metallicity scale, as 
an accurate conversion between the two indicators used 
has not yet been established for the samples in question) . 
Such a shift to lower metallicities (or higher masses) with 
increasing redshift can qualitatively be explained via a 
scenario in which star formation takes place over a more 
protracted period in lower mass galaxies. This scenario is 
consistent with our results; we find that the more mas- 
sive galaxies in our sample have smaller gas fractions, 
and are thus likely to exhaust their gas supply before 
the le ss massive galaxies. 

Thepavaglio et alJ l)2005fl model also predicts a steeper 
relation at z ~ 2 than we have observed; this is a con- 
sequence of their assumption that the shape of the T04 
relation remains unchanged but shifts to higher masses, 
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whereas we find that our data is well-approximated by 
the T04 relation shifted to lower metallicities at the same 
stellar mass. In physical terms, the slope of the stellar 
mass-metallicity relation depends on both the yield and 
the presence or absence of outflows (and inflows). As 
shown in Figure El a closed bo x model such as that as- 
sumed bv lSavaglio et alJ 1)2005(1 results in a steep relation 
between metallicity and gas fraction (and stellar mass), 
while the z ~ 2 data are better described by the shal- 
lower slope of a model with significant outflows. 

While comparisons between the metallicities of star- 
forming galaxies at high and low redshift are of obvious 
interest, it is very likely that our sample and that of T04 
do not form an evolutionary sequence. The likely de- 
scendants of the z ~ 2 sample can be identified by com- 
paring their clustering properties, evolved to z ~ 0, with 
those of objects in the local universe; such a compari- 
son shows that early-type galaxies in the SDSS match 
the clustering properties of the z ~ 2 sample, while the 
later-type star-forming objects studied by T04 are too 
weakly clustered to be the descen dants of the z ~ 2 pop- 
ulation l|Adelberger et all20 05b). In this sense it is more 
relevant to compare the metallicities of the z ~ 2 galax- 
ies with local early-type objects than with star-forming 
galaxies at z ~ 0, though such a comparison would un- 
doubtedly be complicated by systematic offsets resulting 
from the very different methods used to determine metal 
abundances in elliptical galaxies (usually absorption fea- 
tures in the integrated spectra of old stellar populations) . 
Broadly speaking, however, most early-type galaxies in 
the SDSS with M* > 10 10 M Q hav e metallicities rangin g 
from Z ~ 0.6 Z Q to Z ~ 1.5 Z @ ijGallazzi et alJ [20051: 
these results are consistent with the probable final stel- 
lar masses and metallicities of the z ~ 2 galaxies. 

Substantial uncertainties in the mass-metallicity rela- 
tion at z ~ 2 remain, of course. It is important to con- 
firm the trend in metallicity with additional measure- 
ments and with abundance indicators that use a broader 
set of emission lines. The absolute values of the abun- 
dances in the sample are uncertain, and the improved 
understanding of the galaxies' physical conditions that 
will result from a larger sample of lines will be essential 
for determining this absolute scale. Furthermore, our 
derivation of the gas masses and gas fractions is indi- 
rect, and assumes that the Schmidt law takes the same 
form at z ~ 2 as in the local universe. This has not 
yet been tested, although the one similar galaxy with a 
direct measurement of the gas mass, the lensed z = 2.7 
LBG MS1512-cB5 8, appears to be c onsistent with the 
local Schmidt law l|Baker et alJl2004Tl . As long as some 
form of the Schmidt law applies, our results will be qual- 
itatively similar. A related question concerns the appro- 
priateness of the gas masses derived from the Schmidt 
law. These represent only the gas associated with cur- 
rent star formation, and are therefore almost certainly an 
underestimate of the total gas masses. In a typical disk 
galaxy today, ~ 4 0% of the gas mass is not included by 
the Schmidt law {Martin fc Kennicuttll2001f) : this frac- 
tion could plausibly be higher in the young starbursts 
in our sample. It is not clear how or if this gas affects 
the metal enrichment and star formation. Given our lack 
of information on this gas, we do not consider it. T04 
discussed these questions of gas masses derived from the 
Schmidt law in somewhat more detail, but arrived at 



similar conclusions. 

Our simple model for the effect of winds on gas-phase 
abundances assumes that the metallicity of the outflow- 
ing gas is the same as the observed abundances in the 
galaxy; this may not be correct. Metal-enhanced hot 
winds have been observed in X-rays in local starbursts; in 
dwarf galaxies, such winds ma y carry away nearly all the 
oxygen produced by the burst {Martin et alJ20 021 . Such 
a metal-enhanced outflow would increase metal losses for 
a given outflow rate; thus it may be possible to produce 
the relative shallow relationship between Z and /i that 
we have observed with a mass outflow rate smaller than 
the value M ~ 4xSFR we find above. 

We also consider the possibility, discussed in M3.1l and 
21 that we have underestimated the stellar masses of 
the galaxies with the smallest M+ by a factor of a few. 
This would primarily affect the objects in the lowest mass 
bins, and would result in a decrease in the gas fraction 
and a decrease in the effective yield. A factor of three 
increase in the stellar mass of the galaxies in the two 
lowest mass bins would result in a decrease in y e ff of ap- 
proximately a factor of two for those bins; this is enough 
to remove the observed trend of decreasing y e & with in- 
creasing stellar mass, and results instead in an approx- 
imately constant effective yield, roughly consistent with 
the closed box model. Given the significant evidence for 
strong outflows in these galaxies, however, we regard the 
best-fit model obtained above as more plausible. 

Finally, the sample of z ~ 2 galaxies considered here 
is by no means complete. The UV-selection technique is 
most likely to miss galaxies t hat are very dusty o r have 
little current star formation; iReddv et alJ l)2005|) show 
that there is relatively little overlap (~ 12%) between the 
UV-selected sample and z ~ 2 galaxies selected by their 
J — K K > 2.3 co lors (Distant Red Galaxies, or DRGs; 
iFranx et alJl2003Tl . This technique selects galaxies with 
a strong optical break, due to either an evolved stel- 
lar p opulation or strong reddening (e.g. iPapovich et alJ 
2005). Little is known about the metallicities of the 
DRGs. A few rest-frame optical spectra of brighter 
DRGs with K < 20 have presented bv lvan Dokkum et alJ 
J2004ll2005Tk these suggest approximately solar metallic- 
ities, although they frequently show evidence for AGN 
or shock ionization. It will be extremely difficult to mea- 
sure metallicities for passively evolving galaxies at high 
redshift, since they lack strong emission lines. Massive 
galaxies that have consumed or expelled most of their gas 
might be expected to be among the most metal-rich ob- 
jects at high redshift. Dusty, rapidly star-forming galax- 
ies are likely to have large gas masses, but their metallic- 
ities will depend on how many generations of stars have 
enriched the gas. There is probably a range of metallic- 
ities within this population. Some confirmation of this 
can be found in Figure 15 of lReHdv et all pOOfih . who 
plot inferred gas fraction against stellar mass for near- 
IR selected galaxies as well as for UV-selected galaxies 
similar to those considered here. All samples considered 
follow the same trend of decreasing gas fraction with in- 
creasing stellar mass, with the DRGs at the high mass 
and low gas fraction end of the spectrum. This suggests 
that galaxies selected by near-IR techniques may also fol- 
low a similar mass-metallicity relation to that discussed 
here, with a possible extension to higher metallicities for 
the oldest and most massive objects. It will be interest- 
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ing to see if this proves to be true when larger samples 
of metallicity measurements become available. 

In summary, the z ~ 2 galaxies show a strong trend in 
oxygen abundance over a range of only a factor of ~ 6 
in baryonic mass. The effective yield increases slightly 
with decreasing mass, rather than decreasing as would 
be expected if low mass galaxies lost a larger fraction 
of their metals to outflows. We conclude that the mass- 
metallicity relation at high rcdshift is primarily a product 
of varying stages of galaxy evolution, caused by the in- 
crease in metallicity as gas is converted into stars and 
metals are returned to the remaining gas. It may be 
modulated by metal loss from strong outflows in galax- 
ies of all masses, which results in a shallower increase of 
metallicity with decreasing gas fraction than predicted 
by the closed box model. There is no evidence for pref- 
erential loss of metals from lower mass galaxies, as has 
been inferred in the local universe, although the small 
mass range spanned by our sample limits our sensitivity 
to such an effect. 

6. SUMMARY AND CONCLUSIONS 

We have used composite Ha + [N II] spectra of 87 star- 
forming galaxies at z > 2, divided into six bins by stellar 
mass, to study the correlation between stellar mass and 
metallicity at high redshift. Our conclusions are summa- 
rized as follows. 

1. There is a strong correlation between stellar mass 
and metallicity at z > 2, as the quantity 12 + log(0/H) 
increases monotonically from < 8.2 for galaxies with 
(M*) = 2.7 x 10 9 M Q to 8.6 for galaxies with (AQ = 
1.0 x 10 11 Mq. The relation is offset by - 0.3 dex 
from the local mass-metallicity relation, in the sense that 
galaxies of a given stellar mass have lower metallicities at 
high redshift. The absolute values of the oxygen abun- 
dances are uncertain, but the trend is unlikely to be due 
to systematic effects such as AGN contamination, hidden 
stellar mass, or variations in the ionization parameter of 
the H II regions. 

2. Rest-frame B-band luminosity and metallicity are 
not significantly correlated. The z <~ 2 galaxies are 
systematically brighter than local star-forming galaxies 
spanning the same range in stellar mass, showing that 
they have smaller mass-to-light ratios M/L. At a given 
metallicity, local galaxies are ~ 3 magnitudes fainter 
than the z ~ 2 sample. The known large scatter in 
the rest-frame optical M/L at z ~ 2 accounts for the 
lack of correlation between luminosity and metallicity, 
and indicates that the correlation with stellar mass is 
(as expected) more fundamental. 

3. We use the Schmidt law to estimate the gas masses 
and gas fractions of the z ~ 2 galaxies, finding that the 
gas fraction increases substantially with decreasing stel- 
lar mass. The lowest mass bin in the sample has a mean 
gas fraction of 85%, while the highest stellar mass bin has 
a fraction of 20%. Our median gas fraction is ~ 50%, 
as compared to ~ 20% in local star-forming galaxies. 
Galaxies with low stellar masses, low metallicities and 
high gas fractions also have young ages. A consequence 



of the trend in gas fraction with stellar mass is a much 
smaller range in baryonic mass (a factor of 6) than stellar 
mass (a factor of 39) across the sample. 

4. The observed metallicities and gas fractions allow an 
estimate of the effective yield in each mass bin. In con- 
trast to the results of a similar study in the local universe, 
we find no decrease in y e g at low masses; instead, y e ff in- 
creases slightly with decreasing mass. Qualitatively, such 
an increase is a feature of models in which galaxies of all 
masses lose metals from outflows. More quantitatively, 
comparison with simple models shows that the observed 
variation of metallicity with gas fraction is best described 
by a model with supcrsolar yield and an outflow rate 
~ 4 times higher than the star formation rate. We con- 
clude that the mass-metallicity relation at high redshift is 
caused by the increase in metallicity as gas is converted 
to stars, and may be modulated by strong outflows in 
galaxies of all masses. Our ability to detect differential 
metal loss as a function of mass is limited by the small 
range of baryonic masses spanned by the galaxies in the 
sample, but there is no evidence for preferential loss of 
metals from low mass galaxies as is suggested locally. 

Much remains to be done in order to improve upon the 
substantial uncertainties inherent in the present work. 
Independent measurements of the gas masses of galax- 
ies at high redshift, though very difficult, are essential 
to determine whether our derived gas fractions and ef- 
fective yields are valid. Additional metallicity measure- 
ments, based on other indicators that use a wider set of 
emission lines, are needed to confirm the trend with stel- 
lar mass revealed by the A^2 index, to provide a better 
understanding of the physical conditions in the H II re- 
gions, and to establish a secure absolute calibration of 
the metallicity scale. Further investigation of the ques- 
tion of differential metal loss from outflows at high red- 
shift requires observations of fainter galaxies with smaller 
potential wells, to expand the dynamic range in baryonic 
mass. We anticipate that all of these measurements will 
greatly increase our understanding of the interplay be- 
tween stars and gas, within and outside of galaxies, at 
high rcdshift. 
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Table 1. Mean Stellar Masses and Stellar Population Properties 
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Age h 
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(10 10 Mq) 


(M yr" 1 ) 
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15 


2.36 


24.20 ± 0.72 


21.51 ± 0.69 


-21.27 ± 0.73 


0.27 ± 0.15 


55 ± 52 


180 ± 276 


0.19 ± 0.09 


2 


15 


2.27 


23.96 ± 0.81 


21.03 ± 0.61 


-21.66 ± 0.59 


0.71 ± 0.17 


24 ± 10 


448 ± 374 


0.14 ± 0.07 
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15 


2.21 


24.01 ± 0.72 


20.86 ± 0.47 


-21.82 ± 0.58 


1.5 ± 0.3 


30 ± 21 


968 ± 944 


0.11 ± 0.08 


1 


14 


2.28 


23.97 ± 0.62 


20.54 ± 0.49 


-22.15 ± 0.51 


2.6 ± 0.4 


28 ± 17 


1026 ± 864 


0.16 ± 0.07 


5 


14 


2.21 


23.93 ± 0.60 


20.16 ± 0.38 


-22.33 ± 0.51 


4.1 ± 0.6 


47 ± 30 


1311 ± 790 


0.17 ± 0.09 


6 


14 


2.26 


24.32 ± 0.55 


20.05 ± 0.40 


-22.30 ± 0.44 


10.5 ± 5.4 


47 ± 24 


2409 ± 591 


0.19 ± 0.06 



a Number of galaxies in bin 

b Mean and standard deviation of zh q . 

c Mean and standard deviation of TZ magnitude. 

d Mean and standard deviation of K a magnitude. 

c Mean and standard deviation of absolute magnitude Mb, determined as described in the text. 
Mean and standard deviation of stellar mass from SED fitting; we use a lChabrierl <2003T) IMF. 

g Mean and standard deviation of SFR from extinction-corrected Ho luminosity, including a factor of two aperture correction determined from narrow-band imaging and comparison of 
the i\-band continuum with broad-band magnitudes. Because the dispersion is large we also quote the semi interquartile range for bins 1-6 respectively: 29, 7, 13, 12, 13 and 15 Mq yr -1 . 

h Mean and standard deviation of best-fit age from SED fitting. Because the dispersion is large we also quote the semi interquartile range for bins 1-6 respectively: 123, 184, 677, 490, 
541 and 375 Myr. 

'Mean and standard deviation of best-fit E(B — V) from SED fitting. 



Table 2. Oxygen Abundances and Gas Fractions 
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Stellar Mass a 
(10 10 M©) 


F Ha h 

(10~ 17 erg s" 1 cm -2 ) 


^[NII] b 
(10~ 17 erg s _1 cm" 2 ) 


N2 C 


12 + log(0/H) d 


M bar c 
(10 10 M ) 




y e ft s 
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0.27 ± 0.15 


20.5 ±0.5 


< 1.2 


< -1.22 


< 8.20 


2.7±1.7 


0.85 ± 0.12 


< 0.027 


2 
3 
4 
5 
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0.71 ± 0.17 

1.5 ± 0.3 

2.6 ± 0.4 
4.1 ± 0.6 
10.5 ± 5.4 


13.9 ±0.3 
18.7 ±0.4 
15.9 ±0.4 
24.3 ±0.5 
27.0 ±0.4 


1.4 ±0.2 
2.7 ±0.3 
2.6 ±0.3 

5.3 ±0.4 

7.4 ±0.3 


-1 00+ 07 
± ' uu -0.09 

-U.Si>_ 0i06 

n yo + 0.05 

-0.78_ 05 

-0 66+ 03 
u - DD -0.04 

U - OO -0.02 


o no+0.07 
o - oo -0.07 
8 42+ - 06 

8 - 4b -0.05 

8 52+ - 06 
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o cro+0.06 
»-3»_0.04 


2.1±0.6 
3.2±1.1 
4.0±0.9 
6.6±1.1 
13.1±5.6 


0.63 ± 0.12 
0.48 ± 0.19 
0.33 ± 0.12 
0.36 ± 0.10 
0.22 ± 0.11 


0.013±0.003 
0.010±0.002 
0.007±0.001 
0.009±0.002 
0.007±0.001 



a Mean and standard deviation of stellar mass from SED fitting; we use a lChabrierl <2003T) IMF 
b Fluxes of Ha and [N IIJA6584 from the composite spectra. 
c N2 = log(F [ml] /F Ha ) 
Oxygen abundance from iV2, using the calibration of lPettini fc Pagell 120041) . 

c Mean and standard devation of the baryonic mass M gas + M*, with gas masses determined from the Schmidt law as described in the text. 



f Mean and standard deviation of the gas fraction fj, - 
SEffcctive yield y cft = Z/ln(l//x). 



M gas /(M gas +M+). 
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